Detection of laughter in children's speech using spectral and prosodic acoustic features
نویسندگان
چکیده
Laughter is an important para-linguistic cue that can be useful in gauging the affective state of the speaker. In this paper, we present an approach to detecting laughter in children’s speech using acoustic features in the spectral and prosodic domains. Feature selection was performed using the information gain-based technique and a speaker-independent validation using a support vector machine (SVM), an accuracy of 94.43% was observed, which was a 12.48% absolute improvement over the baseline result of 81.95%. For us to explore generalization properties, the models of speech and laughter were tested on a completely different database of adult-child interactions known as the Multimodal Dyadic Behavior Dataset (MDBD). The accuracy using the earlier trained models was 70.58%. Even though the children in this database were toddlers (less than three years old), the results suggest that the predictive power of the selected features generalizes well to different forms of children’s laughter.
منابع مشابه
Detection of children's paralinguistic events in interaction with caregivers
Paralinguistic cues in children’s speech convey the child’s affective state and can serve as important markers for the early detection of autism spectrum disorder (ASD). In this paper, we detect paralinguistic events, such as laughter and fussing/crying, along with toddlers’ speech from the Multi-modal Dyadic Behavior Dataset (MMDB). We use both spectral and prosodic acoustic features selected ...
متن کاملAutomatic discrimination between laughter and speech
Emotions can be recognized by audible paralinguistic cues in speech. By detecting these paralinguistic cues that can consist of laughter, a trembling voice, coughs, changes in the intonation contour etc., information about the speaker’s state and emotion can be revealed. This paper describes the development of a gender-independent laugh detector with the aim to enable automatic emotion recognit...
متن کاملThe effect of bilateral subthalamic nucleus deep brain stimulation (STN-DBS) on the acoustic and prosodic features in patients with Parkinson’s disease: A study protocol for the first trial on Iranian patients
Background: The effect of subthalamic nucleus deep brain stimulation (STN-DBS) on the voice features in Parkinson’s disease (PD) is controversial. No study has evaluated the voice features of PD underwent STN-DBS by the acoustic, perceptual, and patient-based assessments comprehensively. Furthermore, there is no study to investigate prosodic features before and after DBS in PD. The curren...
متن کاملA Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information
Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating them potentially can play an important role in transmitt...
متن کاملParalinguistic Analysis of Children's Speech in Natural Environments
Paralinguistic cues are the non-phonemic aspects of human speech that convey information about the affective state of the speaker. In children’s speech, these events are also important markers for the detection of early developmental disorders. Detecting these events in hours of audio data would be beneficial for clinicians to analyze the social behaviors of children. The chapter focuses on the...
متن کامل